Genes for diagnosing colorectal cancer

ABSTRACT

This invention relates to provide the genes for diagnosing colorectal cancer, the gene sequences searching comprise the steps of: (1) deriving epithelium cells from normal intestines, polypus of intestines and colorectal cancer tissue; (2) collecting genes with highly differential gene expression by Suppression Subtractive Hybridization (SSH), and building library; (3) deriving colonies with relatively high signal intensities from cancer tissue; (4) collecting more clinically cancer tissues by Northern Hybridization, real-time Polymerase Chain Reaction (PCR) combined with analysis of bioinformation to affirm variation between differential gene expression; and (5) selecting the most suitable genes from said library, and using the gene sequence as reagent provides the effects of early diagnosis, specificity, highly sensitivity and safety.

REFERENCE CITED

-   1. WO005531

FIELD OF THE INVENTION

This invention relates to genes for diagnosing colorectal cancer, particularly provided a method of clinical diagnosis for colorectal cancer which enables the effects of early diagnosis, specificity, highly sensitivity and safety.

BACKGROUND OF THE INVENTION

Colorectal cancer is one of the most common malignant tumors of the world; it is the second most frequent cause of malignant tumor related mortality in developed countries. In developed countries, mortality rate caused by colorectal cancer seems have a progressively descending tendency in the past 20 years. There are motivations for early diagnosis and for improvement in methods of therapy and medicines. In Taiwan, the rate of suffering for colorectal cancer is rising constantly, furthermore, there is evidence of an age-descending tendency.

According to a 2002 survey by the Department of Health (DOH), the highest level of the executive branch in Taiwan, on the top ten related cancer of Taiwanese population, colorectal cancer (CRC) is the third leading cause of cancer-related death for male and female. About 6681 new cases of colorectal cancer were diagnosed according to statistical data by DOH in 1999, and 3649 patients died in Taiwan due to colorectal cancer according to statistical data by DOH in 2002. The average age of colorectal cancer patient is lower than other countries. In other words, twenty-year-old or thirty-year-old people suffer from the colorectal cancer in Taiwan. Therefore, we can't ignore the possibility of the colorectal cancer occurring in younger populations.

Although methods of diagnosis and surgical operation treatment have improved for colorectal cancer patients, when one makes a comparison between early diagnosis with later period diagnosis by surgical operation, treatment is able to probably overcome colorectal cancer in early diagnosis, but is not able to absolutely overcome colorectal cancer in later period diagnosis. Thus far metastasis are the main problem in the treatment for the colorectal cancer, therefore, a method with high sensitivity, high specificity and easy diagnosis that can detect arly and potentially curable CRC would be a novel target for CRC diagnosis and therapy.

The present invention is to provide functional genetic method, for diagnostic genes of colorectal cancer consist of 71 types of genes, that can be applied for early diagnosing possibility of recurrence and metastasis for colorectal patients. Simultaneously, tracing of 100 colorectal cancer cases have found a 92% genes variation in colorectal tissue. In the process of tracing 100 colorectal cancer cases simultaneously, mutation of genes is found in 92% colorectal cancer tissues. In the tracing process, although CEA of 16 patients remained in normal value range, the method can detect early tumor cells in blood by using genes variation testing.

In WO0055351, ROSEN CRAIG A et. al., entitled “Human Colon Cancer Associated Gene Sequences And Polypeptides”, disclose colon cancer related polynucleotides and the polypeptides encoded by the polynucleotides herein collectively known as “colon cancer antigens”, screening methods for identifying agonists and antagonists of colon cancer antigens of the invention. But, the present invention is to provide SSH and cDNA microarray technology to identify candidate marker genes which are overexpressed continuously from colorectal proliferous polypus to colorectal oncogene, detecting overexpressed genes are selected from up regulation genes which related intently in colorectal cancer oncogene, and down regulation genes which related in colorectal cancer oncogene. The total 71 genes are used to diagnosing early colorectal cancer.

SUMMARY OF THE INVENTION

Therefore, the main purpose according to the present invention is to provide methods of clinical diagnosis for colorectal cancer for early diagnosis, specificity, highly sensitivity and safety.

For the purpose stated above, the method comprises the steps of: (1) deriving epithelium cells from normal intestines, polypus of intestines and colorectal cancer tissue; (2) collecting genes with highly differential gene expression by Suppression Subtractive Hybridization (SSH), and building library; (3) deriving colonies with relatively high signal intensities from cancer tissue; (4) collecting more clinically cancer tissues by Northern Hybridization, real-time Polymerase Chain Reaction (PCR) combined with analysis of bioinformation to affirm variation between differential gene expression; and (5) selecting the most suitable genes from said library. Moreover, the reagent uses the gene sequence as method of clinical diagnosis for colorectal cancer to the early diagnosis.

BRIEF DESCRIPTION OF THE DRAWINGS

The present invention will be better understood from the following detailed description of preferred embodiments of the invention, taken in conjunction with the accompanying drawings, in which.

Table 1 is a table showing the result of clinical examination of colorectal cancer biochip;

FIG. 1 is a view showing the procedure of deriving genes according to the present invention;

FIG. 2 a and FIG. 2 b are views showing the primary screening according to the present invention;

FIG. 3 a and FIG. 3 b are views showing affirmation to genes using Northern Blotting method according to the present invention;

FIGS. 4 a and 4 b are views showing quantity expression of cancer tissue according to the present invention; and

FIG. 5 is a diagram showing second preferred embodiments according to the present invention.

DESCRIPTION OF THE PREFERRED EMBODIMENTS

The following descriptions of the preferred embodiments are provided to understand the methods and the procedures of the present invention. Please refer to FIG. 1, showing the procedure of searching genes according to the present invention. Said procedure comprise the steps of: (1) deriving epithelium cells from normal intestines, polypus of intestines and colorectal cancer tissue; (2) collecting genes with highly differential gene expression by Suppression Subtractive Hybridization (SSH), and building library; (3) deriving colonies with relatively high signal intensities from cancer tissue; (4) collecting more clinically cancer tissues by Northern Hybridization, real-time Polymerase Chain Reaction (PCR) combined with analysis of bioinformation to affirm variation between differential gene expression; and (5) selecting the most suitable genes from said library. Moreover, by using the gene sequence as a reagent, this enables clinical diagnosis for colorectal cancer to the effects of early diagnosis, specificity, highly sensitivity and safety.

The genes for diagnosing colorectal cancer, the specific oligonucleotides sequence are selected from the group consisting of:

No Hs ID ACC No Discription Definition Oligo sequence 1 Hs.107213 BC027178 FNBP3 Homo sapiens, CATCATAGGAA (SEQUENCE Formin formin binding ACGTTCCCGCT LISTING 72) binding protein 3, clone CTCGATCGGGG protein 3 MGC: 16979 TCAGATTCAGAT IMAGE: 4343048, GATGATG mRNA, (SEQUENCE complete cds LISTING 1) 2 Hs.123107 NM_002257 KLK1 Homo sapiens GCCTTCTGTCG (SEQUENCE Kallikrein 1, kallikrein 1, CCGTCAGAGTG LISTING 73) renal/pancreas/ renal/pancreas/ CTGTCTTATGTG salivary salivary (KLK1), AAGTGGATCGA mRNA. GGACA (SEQUENCE LISTING 2) 3 Hs.1369 NM_000574 DAF Decay Homo sapiens GGGCAGTCAAT (SEQUENCE accelerating decay GGTCAGATATT LISTING 74) factor for accelerating GAAGAGTTCTG complement factor for CAATCGTAGCT (CD55, complement GCGAGGTG Cromer blood (CD55, Cromer (SEQUENCE group blood group LISTING 3) system) system) (DAF), mRNA 4 Hs.151254 NM_005046 KLK7 Homo sapiens TGGAACCACCT (SEQUENCE Kallikrein 7 kallikrein 7 GTACTGTCTCC LISTING 75) (chymotryptic, (chymotryptic, GGCTGGGGCAC stratum stratum TACCACGA corneum) corneum) (SEQUENCE (KLK7), LISTING 4) transcript variant 1, mRNA. 5 Hs.1526 NM_001681 ATP2A2 Homo sapiens CATCGGCATCT (SEQUENCE ATPase, ATPase, Ca++ TCGGGCAGGAT LISTING 76) Ca++ transporting, GAGGACGTGAC transporting, cardiac muscle, GTCAAAAGCTTT cardiac slow twitch 2 CACAG muscle, slow (ATP2A2), (SEQUENCE twitch 2 mRNA LISTING 5) 6 Hs.184270 NM_006135 CAPZA1 Homo sapiens TGACCACTTAC (SEQUENCE Capping capping protein GGAAAGAAGCA LISTING 77) protein actin) (actin filament) AGTGACCCCCA filament) muscle Z-line, GCCAGAAGAAG muscle Z- alpha 1 CAGATG line, alpha 1 (CAPZA1), (SEQUENCE mRNA. LISTING 6) 7 Hs.2043 NM_001151 SLC25A4 Homo sapiens AGATCTTCAAGT (SEQUENCE Solute carrier solute carrier CTGATGGCCTG LISTING 78) family 25 family 25 AGGGGGCTCTA (mitochondrial (mitochondrial CCAGGGTTTCA carrier carrier; adenine ACGTC adenine nucleotide (SEQUENCE nucleotide translocator), LISTING 7) translocator), member 4 member 4 (SLC25A4), nuclear gene encoding mitochondrial protein, mRNA. 8 Hs.267871 NM_005177 ATP6V0A1 Homo sapiens GGACAGAAAGG (SEQUENCE ATPase, H+ ATPase, H+ AATTCAGTGTTT LISTING 79) transporting, transporting, CCTGGTAGTGG lysosomal V0 lysosomal V0 TTGCACTACTGT subunit a subunit a GTGTACCTTGG isoform 1 isoform 1 (SEQUENCE (ATP6V0A1), LISTING 8) mRNA. 9 Hs.4935 D79998 KIAA0176 Human mRNA GGAAAGGATAC (SEQUENCE KIAA0176 for KIAA0176 GGGACAATGAG LISTING 80) protein gene, partial cds AACAGAACTTCA CAAGGCCCCGT GAAGC (SEQUENCE LISTING 9) 10 Hs.5509 NM_006495 EVI2B Homo sapiens GCCCCTGCCAC (SEQUENCE Ecotropic ecotropic viral CAGTAGATTTTA LISTING 81) viral integration site TGAAAAACCAA integration 2B (EVI2B), GAAGATTCCAA site 2B mRNA. CCTTGAGATCC AGTGTC (SEQUENCE LISTING 10) 11 Hs.5662 NM_006098 GNB2L1 Homo sapiens ATGACTGAGCA (SEQUENCE Guanine guanine GATGACCCTTC LISTING 82) nucleotide nucleotide GTGGCACCCTC binding binding protein AAGGGCCACAA protein (G (G protein), beta C protein), beta polypeptide 2- (SEQUENCE polypeptide like 1 (GNB2L1), LISTING 11) 2-like 1 mRNA. 12 Hs.75990 NM_005143 HP Homo sapiens AGGCTGTTGGA (SEQUENCE Haptoglobin haptoglobin GATAAACTTCCT LISTING 83) (HP), mRNA. GAATGTGAAGC AGATGACGGCT GCCCG (SEQUENCE LISTING 12) 13 Hs.83384 NM_006272 S100B S100 Homo sapiens CCGAACTCAAG (SEQUENCE calcium S100 calcium GAGCTCATCAA LISTING 84) binding binding protein, CAATGAGCTTTC protein, beta beta (neural) CCATTTCTTAGA (neural) (S100B), mRNA GGAAATCAAAG AGCAGGAG (SEQUENCE LISTING 13) 14 Hs.10029 NM_001814 CTSC Homo sapiens CACCGGAAAGA (SEQUENCE Cathepsin C cathepsin C AGGTGGGAACT LISTING 85) (CTSC), mRNA GCCTCTGAGAA TGTGTATGTCAA CACAGC (SEQUENCE LISTING 14) 15 Hs. 103982 NM_005409 SCYB11 Homo sapiens GGGCATGGCTA (SEQUENCE Small small inducible TAGCCTTGGCT LISTING 86) inducible cytokine GTGATATTGTGT cytokine subfamily B GCTACAGTTGTT subfamily B (Cys-X-Cys), CAAGGC (Cys-X-Cys), member 11 (SEQUENCE member 11 (SCYB11), LISTING 15) mRNA. 16 Hs.12314 AL049397 Homo Homo sapiens CAACACCACAG (SEQUENCE sapiens mRNA; cDNA ACAGCTGCAGG LISTING 87) mRNA; cDNA DKFZp586C1019 ACTCGATATCCA DKFZp586C1019 (from clone TGGCTTCTTTCC (from DKFZp586C1019) ATCAC clone (SEQUENCE DKFZp586C1019) LISTING 16) 17 Hs.150557 NM_001206 BTEB1 Basic Homo sapiens TTCCACCCCAG (SEQUENCE transcription basic CATGATCAAGC LISTING 88) element transcription GATCGAAAAAG binding element binding GCGCTGGCCAA protein 1 protein 1 CGCTTT (BTEB1), (SEQUENCE mRNA. LISTING 17) 18 Hs.169266 NM_000909 NPY1R Homo sapiens CCGGTCTCGGG (SEQUENCE Neuropeptide neuropeptide Y ATGATGATTATG LISTING 89) Y receptor Y1 receptor Y1 AAACAATAGCC (NPY1R), ATGTCCACGAT mRNA. GCACACAG (SEQUENCE LISTING 18) 19 Hs.1827 NM_002507 NGFR Nerve Homo sapiens CAAGCGGGAGG (SEQUENCE growth factor nerve growth AGGTGGAGAAG LISTING 90) receptor factor receptor CTTCTCAACGG (TNFR (TNFR CTCTGCG superfamily, superfamily, (SEQUENCE member 16) member 16) LISTING 19) (NGFR), mRNA. 20 Hs.1869 NM_002633 PGM1 Homo sapiens GCCAACGGGAT (SEQUENCE Phosphoglucomu- phosphoglucomu- CGGTCGCTTGG LISTING 91) tase 1 tase 1 (PGM1), TTATCGGACAG mRNA. AATGGAATCCT CTCCA (SEQUENCE LISTING 20) 21 Hs.194148 NM_005433 YES1 V-yes- Homo sapiens CAAGTGTGAGC (SEQUENCE 1 Yamaguchi v-yes-1 CATTATGGAGC LISTING 92) sarcoma viral Yamaguchi AGAACCCACTA oncogene sarcoma viral CAGTGTCACCA homolog 1 oncogene TGTCCG homolog (SEQUENCE 1(YES1), mRNA LISTING 21) 22 Hs.2352 X74210 ADCY2 H. sapiens TCGTCTGCTTTG (SEQUENCE Adenylate mRNA for CTGGACAGCTT LISTING 93) cyclase 2 adenylyl cyclase CTGCAATGCAG (brain) CAAAAAAGCCT CTCCC (SEQUENCE LiSTING 22) 23 Hs.246885 NM_017958 FLJ20783 Homo sapiens CCAAGATTCTA (SEQUENCE Hypothetical hypothetical GGACAAACACA LISTING 94) protein protein GCGTATGTGGG FLJ20783 FLJ20783 CTCTGCAGTCA (FLJ20783), TGACCG mRNA. (SEQUENCE LISTING 23) 24 Hs.29665 NM_014944 CLSTN1 Homo sapiens CACGAGCCCTT (SEQUENCE Calsyntenin 1 calsyntenin 1 CTCTGTGACTG LISTING 95) (CLSTN1), AGGATTACCCG mRNA. CTCCATCCATC CAAGAT (SEQUENCE LISTING 24) 25 Hs.3235 NM_002272 KRT4 Keratin Homo sapiens TTCAGCTGTGG (SEQUENCE 4 keratin 4 CTCGGCCATTG LISTING 96) (KRT4), mRNA TAGGCGGTGGC AAGAGAGGT (SEQUENCE LISTING 25) 26 Hs.55209 AF327354 Homo Homo sapiens TAAAGTGGGCT (SEQUENCE sapiens DMR DMR protein CATTGTCATCCC LISTING 97) protein mRNA, CAAGCCAGGCC mRNA, complete cds AGTTCTCCAGG complete cds TGGAA (SEQUENCE LISTING 26) 27 Hs.585 NM_000384 APOB Homo sapiens GCCCAAGGCCA (SEQUENCE Apolipoprotein apolipoprotein B CAGGGGTCCTT LISTING 98) B (including (including Ag(x) TATGATTATGTC Ag(x) antigen) AACAAGTACCA antigen) (APOB), mRNA CTGGG (SEQUENCE LISTING 27) 28 Hs.62187 AF022913 PIGK Homo sapiens TCTTGTCCTTCG (SEQUENCE Phosphati- GPI GCAGCGTGGCC LISTING 99) dylinositol transamidase GCTAGTCATATC glyan, class mRNA, GAGGATCAAGC K complete cds AGAA (SEQUENCE LISTING 28) 29 Hs.63290 NM_012260 HPCL2 2- Homo sapien CATGAACTGCT (SEQUENCE hydroxyphy- 2- GGCCCTTGCTT LISTING 100) tanoyl-CoA hydroxyphy- GTGATTGGTGG lyase tanoyl-CoA lyase TTCCTCTGAAAG (HPCL2), mRNA AAACCAAG (SEQUENCE LISTING 29) 30 Hs.699 NM_000942 PPIB Homo sapiens AGCCGGGATAA (SEQUENCE Peptidylprolyl peptidylprolyl ACCCCTGAAGG LISTING 101) isomerase B isomerase B ATGTGATCATC (cyclophilin (cyclophilin B) GCAGACTGCGG B) (PPIB), mRNA CAAGAT (SEQUENCE LISTING 30) 31 Hs.74111 NM_007367 RALY RNA Homo sapiens AGCGAGGAAGA (SEQUENCE binding RNA binding GCTGGAACACA LISTING 102) protein protein GCCAGGACACA (autoantigen- (autoantigenic, GACGCGGATGA ic, hnRNP- hnRNP- T associated associated with (SEQUENCE with lethal lethal yellow) LISTING 31) yellow) (RALY) transcript variant 2, mRNA 32 Hs.75103 NM_003406 YWHAZ Homo sapiens CGGAAGGTGCT (SEQUENCE Tyrosine 3- tyrosine 3- GAGAAAAAACA LISTING 103) monooxygen- monooxygenase/ GCAGATGGCTC ase/tryptophan /tryptophan 5- GAGAATACAGA 5- monooxygenase GAGAAAATTGA monooxygen- activation GACGG ase activation protein, zeta (SEQUENCE protein, zeta polypeptide LISTING 32) polypeptide (YWHAZ), mRNA 33 Hs.75117 NM_004515 ILF2 Homo sapiens TGACTTCTATTT (SEQUENCE Interleukin interleukin GTGTGAAATGG LISTING 104) enhancer enhancer CCTTTCCCCGG binding factor binding factor 2, GTCAAGCCAGC 2, 45 kD 45 kD (ILF2), ACCTG mRNA (SEQUENCE LISTING 33) 34 Hs.75236 NM_021952 ELAVL4 Homo sapiens GCACCATGGAG (SEQUENCE ELAV ELAV CCTCAGGTGTC LISTING 105) (embryonic (embryonic AAATGGTCCGA lethal, lethal, abnormal CATCCAATACAA abnormal vision, GCAATG vision, Drosophila)-like (SEQUENCE Drosophila)- 4 (Hu antigen D) LISTING 34) like 4 (Hu (ELAVL4), antigen D) mRNA 35 Hs.75258 NM_004893 H2AFY H2A Homo sapiens CACCGAAGCCA (SEQUENCE histone H2A histone GGAAGCCCCGT LISTING 106) family, family, member TTGTAAGCGTG member Y Y (H2AFY), TGTTGTGGTGC transcript variant TTTATT 2, mRNA (SEQUENCE LISTING 35) 36 Hs.75498 NM_004591 SCYA20 Homo sapiens GCTACTCCACC (SEQUENCE Small small inducible TCTGCGGCGAA LISTING 107) inducible cytokine TCAGAAGCAGC cytokine subfamily A AAGCAACTTTGA subfamily A (Cys—Cys), CTGCT (Cys—Cys), member 20 (SEQUENCE member 20 (SCYA20), LISTING 36) mRNA 37 Hs.76913 NM_002790 PSMA5 Homo sapiens GTTTCTTACCCG (SEQUENCE Proteasome proteasome GTCTGAGTACG LISTING 108) (prosome, (prosome, ACAGGGGCGTG macropain) macropain) AATACTTTTTCT subunit, subunit, alpha CCCG alpha type, 5 type, 5 (SEQUENCE (PSMA5), LISTING 37) mRNA 38 Hs.79889 NM_012329 MMD Homo sapiens GCTATGAACAT (SEQUENCE Monocyte to monocyte to GCTGCTAACTG LISTING 109) macrophage macrophage TTACACACACG differentiation- differentiation- CATTCCTCATTG associated associated TTCCGGCC (MMD), mRNA (SEQUENCE LiSTING 38) 39 Hs.82173 NM_005655 TIEG TGFB Homo sapiens TTTGTGGTACC (SEQUENCE inducible TGFB inducible CCAGCCCGTTG LISTING 110) early growth early growth TGCAGAGTTCA response response AAGCCTCCGGT (TIEG) Mrna G (SEQUENCE LISTING 39) 40 Hs.84072 NM_004616 TM4SF3 Homo sapiens GCAATGACTCT (SEQUENCE Transmembrane transmembrane CAAGCAATTTTT LISTING 111) 4 4 superfamily GGTTCTGAAGA superfamily member 3 TGTAGGCTCTA member 3 (TM4SF3), GCTCCTACGTT mRNA GCTGTG (SEQUENCE LISTING 40) 41 Hs.85146 NM_005239 ETS2 V-ets Homo sapiens CTCATGACTCC (SEQUENCE erythroblastosis v-ets GCCAACTGTGA LISTING 112) virus E26 erythroblastosis ATTGCCTTTGTT oncogene virus E26 AACCCCGTGCA homolog 2 oncogene GCAAG (avian) homolog 2 (SEQUENCE (avian) (ETS2), LISTING 41) mRNA 42 Hs.85844 NM_002529 NTRK1 Homo sapiens TTCATGGACAA (SEQUENCE Neurotrophic neurotrophic CCCTTTCGAGTT LISTING 113) tyrosine tyrosine kinase, CAACCCCGAGG kinase, receptor, type 1 ACCCCATCCCT receptor, type (NTRK1), mRNA GTCT 1 (SEQUENCE LISTING 42) 43 Hs.88219 NM_003454 ZNF200 Zinc Homo sapiens CCCAGTCAGAA (SEQUENCE finger protein zinc finger AGTCAAGGAGA LISTING 114) 200 protein 200 CCTTGGTTATTA (ZNF200), TGAAAGATGTG mRNA AGCTCAAGCCT TCAGAACAG (SEQUENCE LISTING 43) 44 Hs.9914 NM_006350 FST Homo sapiens CCCTGACAGTA (SEQUENCE Follistatin follistatin (FST), AGTCGGATGAG LISTING 115) transcript variant CCTGTCTGTGC FST317, mRNA CAGTGACAATG CCACTT (SEQUENCE LISTING 44) 45 Hs.169319 NM_003419 ZNF345 Zinc Homo sapiens CAGGGATCTCA (SEQUENCE finger protein zinc finger GGAAGGACATT LISTING 116) 345 protein 345 TCAGTGAAATG (ZNF345), ATATTTACTCCT mRNA GAAGACATGCC CACTTTCAG (SEQUENCE LISTING 45) 46 Hs.72805 NM_030921 DC42 Homo sapiens GGCATGGCAGC (SEQUENCE Hypothetical hypothetical AAATGCCAACAT LISTING 117) protein DC42 protein DC42 TTTGTGGAATAG (DC42), mRNA CAGCAAATCTA CAAGAGACCCT GG (SEQUENCE LISTING 46) 47 HS.108301 NM_003297 NR2C1 Homo sapiens GACACCTACAG (SEQUENCE Nuclear nuclear receptor GTTATCCAGACT LlSTING 118) receptor subfamily 2, ACTACTCAGATT subfamily 2, group C, GCCAGCTTTAA group C, member 1 GACTGATGAAT member 1 (NR2C1), mRNA GCTACCATC (SEQUENCE LISTING 47) 48 Hs.177926 NM_030941 LOC81691 Homo sapiens CCCAGTGACGA (SEQUENCE Exonuctease exonuclease CCAAACTCAAA LISTING 119) NEF-sp NEF-sp GATGTACAGAG (LOC81691), GCAGTTAAAAG mRNA CACTGCTTCCT C (SEQUENCE LISTING 48) 49 Hs.194746 NM_018896 CACNA1G Homo sapiens ACGTCAGAGAT (SEQUENCE Calcium calcium TGTGTCTGAAC LISTING 120) channel, channel, CGTCCTGCTCT voltage- voltage- CTAGCTCTGAC dependent, dependent, GGATGA alpha 1G alpha 1G (SEQUENCE subunit subunit LISTING 49) (CACNA1G), mRNA 50 Hs.209061 NM_003831 SUDD SudD Homo sapiens TCACGGCCTGG (SEQUENCE suppressor of sudD AGTTCTTGTTCC LISTING 121) bimD6 suppressor of GGGACTGCAGG homolog (A. bimD6 homolog AATGTCTCGCA nidulans) (A. nidulans) GTT (SUDD), mRNA (SEQUENCE LISTING 50) 51 Hs.25087 NM_006070 TFG TRK- Homo sapiens TAATCCTTATGC (SEQUENCE fused gene TRK-fused gene GCGTAACCGTC LISTING 122) (TFG), mRNA CTCCCTTTGGT CAGGGCTATAC CCAAC (SEQUENCE LISTING 51) 52 Hs.3017 NM_003284 TNP1 Homo sapiens GATCAAAGCCA (SEQUENCE Transition transition protein GAGAGGAGCCT LISTING 123) protein 1 1 (during ATGGAATGTGG (during histone to ATCAAATGCCA histone to protamine GTTGTGACG protamine replacement) (SEQUENCE replacement) (TNP1), mRNA LISTING 52) 53 Hs.283664 NM_032466 ASPH Homo sapiens GAACCACAACA (SEQUENCE Aspartate aspartate beta- AGAGGATGATG LISTING 124) beta- hydroxylase AGTTTCTTATGG hydroxylase (ASPH), CGACTGATGTA transcript variant GATGATAGATTT 3, mRNA GAGACCCTGG (SEQUENCE LISTING 53) 54 Hs.283664 NM_032467 ASPH Homo sapiens CTCAGGGAGAT (SEQUENCE Aspartate aspartate beta- GGATTTGCTCG LISTING 125) beta- hydroxylase TTGTTTTCTTCC hydroxylase (ASPH), CTCCTTCCCCTT transcript variant CCTG 4, mRNA (SEQUENCE LISTING 54) 55 Hs.171992 NM_002843 PTPRJ Homo sapiens CCGTGGATGTG (SEQUENCE Protein protein tyrosine TATGGGATTGT LISTING 126) tyrosine phosphatase, GTATGACCTTC phosphatase, receptor type, J GAATGCATAGG receptor type, (PTPRJ), mRNA CCTTTAATGGTG J C (SEQUENCE LISTING 55) 56 Hs.155172 NM_003664 AP3B1 adaptor-related GCCCAGCTTAT (SEQUENCE protein complex CATAAACACTGA LISTING 127) 3, beta 1 subunit GAAAACTGTGA TTGGCTCTGTTC TGCTGCGGG (SEQUENCE LISTING 56) 57 Hs.183418 M37712 CDC2L2 cell dividion CGAGAAAATGA (SEQUENCE cycle2-like2 AAACCACCTCTT LISTING 128) GGTTGTTCCAG AGTCACGGTTC GACCGAG (SEQUENCE LISTING 57) 58 Hs.244473 NM_031900 AGXT2 alanine- TCCGGGATTGT (SEQUENCE glyoxylate TACTGTCAGTGT LISTING 129) aminotransferase TGGCCATTGCC 2 ACCCAAAGGTG AATGC (SEQUENCE LISTING 58) 59 Hs.12835 NM_004842 AKAP7 A kinase GAGCCCGATGA (SEQUENCE (PRKA) anchor CGCTGAACTAG LISTING 130) protein 7 TAAGGCTCAGT AAGAGGCTGGT GGAGAA (SEQUENCE LISTING 59) 60 Hs.1650 NM_000111 SLC26A3 solute carrier TCAGCCCCCTA (SEQUENCE family 26, TTACACCTGAC LISTING 131) member 3 GTGGAGACTTT CCAAAACACCG TAGGAG (SEQUENCE LISTING 60) 61 Hs.29981 NM_000112 SLC26A2 solute carrier CAGCAGGGATC (SEQUENCE family 26 CACACACTGAA LISTING 132) (sulfate AGAAGTTCGCA transporter), GAGATTATGAA member 2 GCCATTGGAAT CC (SEQUENCE LISTING 61) 62 Hs.2246 NM_001308 CPN1 carboxypeptidase TCAAGTAAGCC (SEQUENCE N, polypeptide CTGTGAGGAGA LISTING 133) 1, 50 kD GCTCCCAGCAG AAGGCACGGAG T (SEQUENCE LISTING 62) 63 Hs.267871 NM_005177 ATP6V0A1 ATPase, H+ AAATGCTTGATT (SEQUENCE transporting, GCAGAGGTCTG LISTING 134) lysosomal V0 GTGCCCTGTCA subunit a CCGACCTTGAC isoform 1 TCCAT (SEQUENCE LISTING 63) 64 Hs.75445 NM_004684 SPARCL1 SPARC-like 1 CTGCGAGCATC (SEQUENCE (mast9, hevin) TCTGGTGCCCA LISTING 135) TGGAACACTGC ATAACCCGTTTC TTTGA (SEQUENCE LISTING 64) 65 Hs.39957 NM_016445 PLEK2 pleckstrin 2 TGGCGTTCCCA (SEQUENCE (mouse) CTGGGGTTAAA LISTING 136) homolog GGGAATGTCCA GGGAAACCTCT TCAAAG (SEQUENCE LISTING 65) 66 Hs.65029 NM_002048 GAS1 growth arrest- CGACTACTACG (SEQUENCE specific 1 ATGAGGACTAC LISTING 137) GATGACGAGCA GCGCACCGG (SEQUENCE LISTING 66) 67 Hs.239926 NM_006745 SC4MOL sterol-C4-methyl GCTGGTTCTCG (SEQUENCE oxidase-like GCATCATGATTT LISTING 138) CCACCACATGA ACTTCATTGGAA ACTATGCTTCAA C (SEQUENCE LISTING 67) 68 Hs.59271 NM_006758 U2AF1 U2(RNU2) small TCTGTGACAAC (SEQUENCE nuclear RNA CTGGGAGACCA LISTING 139) auxillary factor 1 CCTGGTGGGGA ACGTGTACGTC AAGTTT (SEQUENCE LISTING 68) 69 Hs.8867 NM_001554 CYR61 cysteine-rich, CAAAACGCAGC (SEQUENCE angiogenic CCTGCGACCAC LISTING 140) inducer, 61 ACCAAGGGGCT GGAATGCAACT T (SEQUENCE LISTING 69) 70 Hs.50123 NM_003452 ZNF189 zinc finger CAACAGCGCAG (SEQUENCE protein 189 TCTTGTCAACCA LISTING 141) TCAGATGATCC ATGCAGAGGTG AAAACCC (SEQUENCE LISTING 70) 71 Hs.82071 NM_006079 CITED2 Cbp/p300- CACCAGATGAA (SEQUENCE interacting CGGGACAAACC LISTING 142) transactivator, AGCACTTCCGA with Glu/Asp- GATTGCAACCC rich carboxy- CAAGCA terminal domain, (SEQUENCE 2 LISTING 71)

From the above table, the HS ID of the 71 genes comprises:

Hs. 107213 Hs. 123107 Hs. 1369 Hs. 151254 Hs. 1526 Hs. 184270 Hs. 2043 Hs. 267871 Hs. 4935 Hs. 5509 Hs. 5662 Hs. 75990 Hs. 83384 Hs. 10029 Hs. 103982 Hs. 12314 Hs. 150557 Hs. 169266 Hs. 1827 Hs. 1869 Hs. 194148 Hs. 2352 Hs. 246885 Hs. 29665 Hs. 3235 Hs. 55209 Hs. 585 Hs. 62187 Hs. 63290 Hs. 699 Hs. 74111 Hs. 75103 Hs. 75117 Hs. 75236 Hs. 75258 Hs. 75498 Hs. 76913 Hs. 79889 Hs. 82173 Hs. 84072 Hs. 85146 Hs. 85844 Hs. 88219 Hs. 9914 Hs. 169319 Hs. 72805 Hs. 108301 Hs. 177926 Hs. 194746 Hs. 209061 Hs. 25087 Hs. 3017 Hs. 283664 Hs. 283664 Hs. 171992 Hs. 155172 Hs. 183418 Hs. 244473 Hs. 12835 Hs. 1650 Hs. 29981 Hs. 2246 Hs. 267871 Hs. 75445 Hs. 39957 Hs. 65029 Hs. 239926 Hs. 59271 Hs. 8867 Hs. 50123 Hs. 82071 etc.

We obtain said specific oligonucleotides sequences by using analysis of OMP (Oligonucleotide Modeling Platform, DNA Software, Inc., Ann Arbor, Mich.) DNA software, Said gene sequences can act as a reagent, a biochip and a medicine for detecting colorectal cancer shown in table 1.

According to the present invention, FIG. 2 a and FIG. 2 b are views showing the primary screening. FIG. 3 a and FIG. 3 b are views showing affirmation to genes using Northern Blotting method. FIGS. 4 a and 4 b are views showing quantity expression of cancer tissue we search over progressive distinctive new genes among the carcinoma process of colorectal cancer by using SSH method to build up CRA libraries and CRC libraries which make the comparison between adenoma, adenocarcinoma and normal tissue, that obtain over 5000 clones in per library; then randomly select about 3000 clones of cDNA from per library to dot on nylon membrane as pre-screen by using Colony Hybridization shown in FIG. 2 a and FIG. 2 b. The high expression colonies in colorectal cancer and adenoma are selected by the Colony Hybridization and then the nucleic acid of cDNA after purification spot on glass chip by using microarray testing.

The expression profiles of the cDNA chips were derived from a set of cDNA probes including adenoma, adenocarcinoma and the corresponding normal tissue from the same patient. Genes exhibiting at least three-fold greater intensities in the adenocarcinoma or adenoma than in corresponding normal tissue samples were considered significant. The significant up-regulated genes were then further confirmed by Northern blot (FIG. 3 a and FIG. 3 b) and subsequently sequenced. Northern analysis of each set of cDNA genes on the chip revealed that 36 genes were detected as up-regulated in adenoma compared to normal, and 54 genes were detected as up-regulated in carcinoma as compared to the normal control. A set of 23 genes with serial increase of genes expression from adenoma to carcinoma was identified.

Further, comparison is made by using EMBL/GenBank libraries of NCBI/BLAST program, there are 3 unknown functional genes among 23 identified genes including ectopic viral integration site 2B (Genbank accession no.NM-006495) Homo sapiens chromosome 21q22.1 anonymous mRNA sequence (Genebank accession no.AF003738) and Homo sapiens DMR protein mRNA (Genbank accession no.AF327354), and another 20 functional genes. Among these 20 functional genes, 6 genes are CRC-related (such as TM4SF3), 14 genes are CRC-unrelated (such as ATP2A2). Moreover, we obtain cDNAs of three patients who suffer from adenoma and adenocarcinama simultaneously and four colorectal cancer patients to affirm variation of 23 identified genes, result shown that were at least 3-fold higher in mRNA expression level in the adenocarcinoma tissues compared with normal samples, and the level gradually increased from colorectal adenomas to adenocarcinomas shown in FIG. 4 a and FIG. 4 b.

Now, methods of clinical diagnosis for detecting colorectal cancer are fecal occult blood test, image test, tumor label and colonoscopy. In each of these methods, we can generalize purpose of the present invention according to disadvantage of these methods.

1. Early Diagnosis

If patient undergo colorectal cancer before tumor cells spread out, five-year survival rate can be achieved over 90%. A certain number of tumor cells are needed for traditional detection by using tumor label method. In the case of image test, normally, correctly affirmation can be made easier when tumor become large. It is high invasion and price to make low acceptance for the patient in the colonoscopy that can not suitable for early diagnosis. Because of the process of circulating of tumor cells, different expression certainly happen among the genes. In the process of proliferation of early tumor cells, the dying cells cause molecule of ribonucleic acid to release into blood circulation. And, early diagnosis can be offered by the detection of using the constructed oligonucleotide biochip which is discharged from small number of tumor cells in the peripheral blood.

2. Specificity and Sensitivity

Fecal occult blood test has shortcomings for high false positives and false negatives to low specificity and sensitivity of the method, therefore the method is merely a first screening tool and the tumor label method is also not high specificity and sensitivity. But, we use these genes to detect peripheral blood of 100 CRC patients, peripheral blood of 50 healthy people and 40 other cancer-related patients as controls shown in FIG. 1, these genes can detect 88 colorectal cancer patients for remarkable sensitivity of 88% ( 88/100) and specificity of 90% ( 90/100) in the clinical analysis.

3. Safety

The colonoscopy has high invasion and price to make low acceptance for patient in the mass screening tool of early diagnosis. Because sample collection is convenience and low invasion, Peripheral blood test of patient is a diagnosis method of genes, that is suitable to mass screening clinical application.

Please refer to FIG. 5, showing another preferred embodiment according to the present invention. We choose genes of colorectal cancer and vector that express simultaneously in eukaryotic and prokaryotic to form recombination genes, and then form eukaryotic transformant cell by using and further form prokaryotic transfectant cell, and then obtain secreted protein by using extract of genes having said recombination genes, and obtain antibody from said secreted protein immune animals for making of protein testing reagent, colorectal vaccine and colorectal protein medicine for colorectal cancer.

The present invention may be embodied in other specific forms without departing from the spirit of the essential attributes thereof; therefore, the illustrated embodiment should be considered in all respects as illustrative and not restrictive, reference being made to the appended claims rather than to the foregoing description to indicate the scope of the invention.

TABLE 1 Assessment of clinical testing result for colorectal cancer biochip No. Age Sex diagnosis Stage result 1 43 F Colon cancer C1 P 2 35 F Colon cancer B2 P 3 68 M Colon adenoma P 4 56 F Colon cancer C2 P 5 32 F Colon cancer B1 N 6 65 M Colon cancer B2 P 7 46 M Colon cancer C1 P 8 67 M Colon cancer C1 P 9 58 F Colon cancer C1 P 10 45 M Colon cancer B2 P 11 62 F Colon adenoma P 12 64 F Colon cancer C2 P 13 58 F Colon cancer A N 14 76 M Colon cancer C1 P 15 38 M Colon cancer B2 P 16 67 M Colon cancer C1 P 17 86 F Colon adenoma P 18 47 F Colon cancer C2 P 19 56 M Colon cancer B1 P 20 67 F Colon cancer B2 P 21 43 F Colon adenoma P 22 65 M Colon cancer A P 23 43 F Colon cancer C2 P 24 54 M Colon cancer B1 P 25 34 F Colon cancer B2 P 26 76 F Colon adenoma P 27 66 M Colon cancer B2 P 28 78 F Colon cancer B1 P 29 57 M Colon cancer B2 P 30 74 M Colon adenoma P 31 65 F Colon cancer B1 P 32 64 F Colon cancer B2 P 33 62 M Colon cancer B1 P 34 46 M Colon cancer B2 P 35 54 F Colon cancer B1 P 36 58 F Colon cancer B1 P 37 64 F Colon adenoma P 38 56 M Colon cancer B1 P 39 67 M Colon cancer B2 P 40 48 F Colon cancer B1 P 41 55 M Colon cancer B1 P 42 64 F Colon adenoma P 43 58 F Colon cancer C1 P 44 65 M Colon cancer C1 P 45 66 M Colon cancer C2 P 46 43 M Colon cancer C1 P 47 26 F Colon cancer C2 P 48 54 M Colon cancer C1 P 49 59 F Colon cancer C2 P 50 71 F Colon adenoma N 51 37 M Colon cancer C1 P 52 47 F Colon cancer C1 P 53 62 M Colon cancer C2 P 54 47 M Colon adenoma P 55 55 F Colon cancer B2 P 56 48 M Colon cancer B1 P 57 66 F Colon cancer B2 P 58 64 M Colon cancer B1 N 59 30 M Colon cancer B1 P 60 56 F Colon cancer B2 P 61 46 M Colon cancer B1 P 62 67 F Colon cancer B2 P 63 35 M Colon cancer B1 P 64 45 F Colon cancer B1 P 65 86 F Colon cancer B2 P 66 54 M Colon cancer B1 P 67 57 M Colon cancer C1 P 68 76 F Colon cancer C2 P 69 46 M Colon cancer C1 P 70 68 M Colon cancer B2 P 71 45 F Colon cancer B1 P 72 87 M Colon cancer B1 P 73 53 M Colon cancer C1 P 74 58 F Colon cancer A P 75 54 M Colon cancer B1 P 76 67 F Colon cancer C2 P 77 56 F Colon cancer A N 78 35 M Colon adenoma P 79 79 F Colon cancer B2 P 80 82 M Colon cancer C2 P 81 76 M Colon cancer C2 P 82 54 F Colon cancer C1 P 83 42 M Colon cancer B1 P 84 68 M Colon cancer B1 P 85 27 M Colon cancer B2 P 86 67 F Colon cancer B2 P 87 46 M Colon adenoma N 88 76 F Colon cancer B1 P 89 44 MF Colon cancer B1 P 90 56 F Colon cancer B2 P 91 65 M Colon cancer C2 P 92 57 F Colon cancer C1 P 93 67 M Colon cancer B1 P 94 78 F Colon adenoma P 95 56 F Colon cancer C1 P 96 56 M Colon cancer C1 P 97 45 F Colon cancer B1 P 98 63 F Colon cancer B2 P 99 62 M Colon cancer C2 P 100 54 F Colon cancer C1 P

Controls NO Age Sex Diagnosis result 1 76 F Breast cancer N 2 35 F Breast cancer N 3 74 F Breast cancer P 4 57 F Gastric cancer N 5 87 F Breast cancer N 6 55 M Gastic cancer N 7 35 M NPC N 8 78 F Breast cancer N 9 65 M NPC N 10 55 F Breast cancer N 11 54 M NPC N 12 67 F normal N 13 86 M Gastic cancer P 14 53 F NPC N 15 58 F normal N 16 78 F Breast cancer N 17 45 M normal N 18 78 F normal N 19 87 F normal N 20 45 M normal N 

1. A method for diagnosing colorectal cancer comprising: contacting a gene consisting of a gene sequence of SEQ ID NO: 76 with a colorectal cancer containing cells; and detecting molecules of ribonucleic acid released in a peripheral blood by said colorectal cancer containing cells which reacted with SEQ ID NO: 76, thereby diagnosing the colorectal cancer containing cells. 